Perils of parsimony: properties of reduced-rank estimates of genetic covariance matrices.
نویسندگان
چکیده
Eigenvalues and eigenvectors of covariance matrices are important statistics for multivariate problems in many applications, including quantitative genetics. Estimates of these quantities are subject to different types of bias. This article reviews and extends the existing theory on these biases, considering a balanced one-way classification and restricted maximum-likelihood estimation. Biases are due to the spread of sample roots and arise from ignoring selected principal components when imposing constraints on the parameter space, to ensure positive semidefinite estimates or to estimate covariance matrices of chosen, reduced rank. In addition, it is shown that reduced-rank estimators that consider only the leading eigenvalues and -vectors of the "between-group" covariance matrix may be biased due to selecting the wrong subset of principal components. In a genetic context, with groups representing families, this bias is inverse proportional to the degree of genetic relationship among family members, but is independent of sample size. Theoretical results are supplemented by a simulation study, demonstrating close agreement between predicted and observed bias for large samples. It is emphasized that the rank of the genetic covariance matrix should be chosen sufficiently large to accommodate all important genetic principal components, even though, paradoxically, this may require including a number of components with negligible eigenvalues. A strategy for rank selection in practical analyses is outlined.
منابع مشابه
Statistical Methods A NOTE ON BIAS IN REDUCED RANK ESTIMATES OF COVARIANCE MATRICES
Fitting only the leading principal components allows genetic covariance matrices to be modelled parsimoniously, yielding reduced rank estimates. If principal components with non-zero variances are omitted from the model, genetic variation is moved into the covariance matrices for residuals or other random effects. The resulting bias in estimates of genetic eigen-values and -vectors is examined.
متن کاملCattle and Sheep Growth REDUCED RANK ESTIMATES OF THE GENETIC COVARIANCE MATRIX FOR LIVE ULTRA-SOUND SCAN TRAITS
Multivariate restricted maximum likelihood analyses for a large data set comprising eight traits were carried out, estimating the leading 3, 4, 5 and 6 genetic principal components only. Traits were eye muscle area, percentage intra-muscular fat, and fat depth at the 12/13th rib and P8 sites, treating records on bulls and heifers or steers as different traits. The resulting, reduced rank estima...
متن کاملComputing techniques: Developments and validations SAMPLING BEHAVIOUR OF REDUCED RANK ESTIMATES OF GENETIC COVARIANCE FUNCTIONS
A simulation study investigating relative errors and sampling variances of reduced rank estimates of genetic covariance functions from random regression analyses estimating the leading principal components only, is presented. The example considered pertains to covariance functions for growth of beef cattle. It is demonstrated that the leading principal components are estimated most accurately, ...
متن کاملEstimation of genetic and phenotypic covariance functions for longitudinal or ‘repeated’ records by Restricted Maximum Likelihood
Covariance functions are the equivalent of covariance matrices for traits with many, potentially infinitely many, records in which the covariances are defined as a function of age or time. They can be fitted for any source of variation, e.g. genetic, permanent environment or phenotypic. A suitable family of functions for covariance functions are orthogonal polynomials. These give the covariance...
متن کاملMatrix Completion by the Principle of Parsimony
Dempster’s covariance selection method is extended first to general nonsingular matrices and then to full rank rectangular matrices. Dempster observed that his completion solved a maximum entropy problem. We show that our generalized completions are also solutions of a suitable entropy-like variational problem.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Genetics
دوره 180 2 شماره
صفحات -
تاریخ انتشار 2008